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We explore various estimators for the parameters of a pair-copula construction (PCC), among 
those the stepwise semiparametric (SSP) estimator, designed for this dependence structure. 
We present its asymptotic properties, as well as the estimation algorithm for the two most 
common types of PCCs. Compared to the considered alternatives, that is, maximum likelihood, 
inference functions for margins and semiparametric estimation, SSP is in general asymptotically 
less efficient. As we show in a few examples, this loss of efficiency may however be rather low. 
Furthermore, SSP is semiparametrically efficient for the Gaussian copula. More importantly, 
it is computationally tractable even in high dimensions, as opposed to its competitors. In any 
case, SSP may provide start values, required by the other estimators. It is also well suited for 
selecting the pair-copulae of a PCC for a given data set. 

Keywords: copulae; efficiency; empirical distribution functions; hierarchical construction; 
stepwise estimation; vines 

1. Introduction 

The last decades' technological revolution have considerably increased the relevance of 
multivariate modelling. Copulae are now regularly used within fields such as finance, sur- 
vival analysis and actuarial sciences. Although the list of parametric bivariatc copulae is 
long and varied, the choice is rather limited in higher dimensions (Genest et al. [15]). Ac- 
cordingly, a number of hierarchical, copula-based structures have been proposed, among 
those the pair-copula construction (PCC) of Joe [22], further studied and considered by 
Bedford and Cooke [2, 3], Kurowicka and Cooke [30] and Aas et al. [1]. 

A PCC is a treelike construction, built from pair-copulae with conditional distributions 
as their two arguments (see Figure 1). The number of conditioning variables is zero at 
the ground, and increases by one for each level, to ensure coherence of the construction. 
Despite its simple structure, the PCC is highly flexible and covers a wide range of complex 
dependencies (Joe et al. [25], Hobask Haff et al. [21]). After Aas et al. [1] set it in an 
inferential context, it has made several appearances in the literature (Fischer et al. [13], 
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D-vine C-vine 

Figure 1. Five-dimensional D-vine (to the left) and C-vine (to the right). 



Chollcte et al. [6], Hcinen and Valdcsogo [19], Schirmacher and Schirmacher [35], Czado 
and Min [9], Kolbj0rnsen and Stien [29], Czado et al. [10, 11]), exhibiting its adequacy 
for various applications. 

Regardless of its recent popularity, estimation of PCC parameters has so far been 
addressed mostly in an applied setting. The aim of this work is to explore the properties 
of alternative estimators. As the PCC is a member of the multivariate copula family, 
one may exploit the large collection of estimators proposed for that model class, such as 
moments type procedures, based on, for instance, the matrix of pairwise Kendall's tau 
coefficients (Clayton [8], Oakes [34], Genest [14], Genest and Rivest [17]). Such methods 
may be well-suited for particular copula families. We are however interested in more 
general procedures, allowing for broader model classes. Moreover, we wish to exploit the 
specific structure of the PCC. 

More specifically, the number of parameters of a PCC grows quickly with the dimen- 
sion, even if all pair-copulac constituting the structure are from one-parameter families. 
In medium to high dimension, the existing copula estimators may simply become too 
demanding computationally, and will at least require good start values in the optimisa- 
tion procedure. Furthermore, due to the PCC's tree structure, selection of appropriate 
pair-copulae for a given data set must be done level by level. Procedures that estimate 
all parameters simultaneously are therefore unfit for this task. 

In all, we contemplate four estimators. The first is the classical maximum likelihood 
(ML), followed by the inference functions for margins (IFM) and semiparametric es- 
timators, that have been developed specifically for multivariate copulae. These three 
estimators are treated in Section 2, and are included mostly for comparison. Section 3 is 
devoted to the fourth one, the stepwise semiparametric estimator (SSP). Unlike the oth- 
ers, it is designed for the PCC structure. Although it has been suggested and used earlier 
(Aas et al. [1]), it has never been formally presented, nor have its asymptotic properties 
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*: i = l,...,d- j, **: i = 0,...,d-j. 

been explored. In Section 4, we compare the four estimators in a few examples. Finally, 
Section 5 presents some concluding remarks. 

The setting is as follows. Consider the observations xi,...,x„ of n independent d- 
variate stochastic vectors Xi,...,X n , originating from the same pair-copula construc- 
tion. Assume further that the joint distribution is absolutely continuous, with strictly 
increasing margins. The corresponding copula is then unique (Sklar [37]). Letting a and 
9 denote the parameters of the margins and copula, respectively, the joint probability 
density function (p.d.f.) may then be expressed as (McNeil et at [32], page 197) 



Here, Fi and I = 1, . . . , d, are the marginal cumulative distribution functions (c.d.f.s) 
and p.d.f.s, respectively, and c\...d is the corresponding copula density. Since this is a 
PCC, c\...d is, in turn, a product of pair-copulae. 

Define the index sets Vij ={i + l,...,i+j — l}, Wij = {i,Vij,i+j}, for 1 < i < d— j, 1 < 
j < d — 1, with Vn = 0, and = i- Thus, for a vector a = (ai, . . . , ad), we write a. Vij = 
(<Zi+i, . . . , Oi-if-j-i) and a Wij = (a,, . . . , dj+j)- Further, for an index k and a set of indices 
v, with k (j: v, let Fu v be the conditional c.d.f. of given X„ = x„, and Ci t i+ji Vi . the 
copula density corresponding to the conditional distribution F^ + j\ Vi . of (Xi, Xi + j) given 
= x Vi ,. Finally, let i-i+ j\ Vij be the parameters of the copula density Cij + j\ Vij , and 
define 0, ., , = {9 s ,s+t\v Bt - (s,s + t) G Wy}, with 6^ = 0, and 8j = {6 s , s +t\v st - \v a t \ = 
j — 1}, where | ■ | denotes the cardinality (i.e., 6j gathers all parameters from level j of 
the structure). Table 1 gives an overview of the notation. For a so-called D-vine (Bedford 
and Cooke [2, 3]), the joint p.d.f. (1) can now be written as (Aas et al. [1]) 

fi...d(xi,...,Xd;a,6) 



d 



f\...d{xi 



,...,x d ;a,9) = c 1 ...d(F 1 (xi;a 1 ), . . . ,Fd{x d ;a. d );0)Y\_.fi(xi;ai). (1) 



d d-ld-j 



Jfl(xv, ai )l 




(2) 



1=1 3=1 i=l 



^i+j\Vij l x ' 
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In four dimensions, this becomes 

fi234,(xi,x 2 ,x 3 ,X4,;oL,9) = f\(x\\a.\) ■ f 2 (x 2 ;a 2 ) ■ f 3 (x 3 ;a 3 ) ■ f^x^on) 

■ c 12 {Fx{x 1 ;a. 1 ),F 2 (x 2 ]oi 2 );6x 2 ) 

■ c 2 3 {F 2 (x 2 ; 0L2), F3 (X3; 0:3); 23 ) 

■ 234(^3(^3; 0:3), ^4(^4; 0:4); ^34) (3) 
• ci3\2(F 1 \2(xi\x 2 ;a 1 ,cx2, 612), F 3 \ 2 (x 3 \x 2 ; a 2 , a 3 , 23 );6 13 \ 2 ) 

■ c 24|3(^ 7, 2|3(^2|^3; " 2 , "3, #23), ^4|3 (^4 1^3 5 "3, «4, #34); #24|3) 
' Cl4|23 (^1|23(^1 1^2, 2:3; «1, «2, "3, #12, #23, #13|2), 

^4123(^41x2, x 3 ; a 2 , a 3 , a 4 , #23, #34, #24|3); #i4|23)- 

For simplicity, we will start by assuming that the distribution in question is a D-vine, 
represented to the left in Figure 1 for rf=5. Similar results can be obtained for C- vines 
(Section 3.3) and other regular vines (Bedford and Cooke [2, 3]). We also assume that 
the PCC is of a simplified form (Hobaek Haff et al. [21]), that is, that the parameters 
9i.i+j\vi- of the copulae C iti+ j\ v . . , combining conditional distributions, are not functions 
of the conditioning variables . . Without this assumption, inference on these models is 
not doable in practice. 

2. Multivariate copula estimators 

As previously mentioned, a PCC is a multivariate copula. Hence, one may estimate 
its parameters with well-known methods, such as maximum likelihood or the two-step 
inference functions for margins and scmiparametric estimators. 

2.1. Maximum likelihood (ML) estimator 

Supposing the model is true, the ML estimator is a natural choice, due to its asymptotic 
efficiency and other advantageous characteristics. According to (2), the log- likelihood 
function of a D-vine is given by 

n 

l{a, 8;x)=^2 log(/i...d(xifc, . ..,x dk ;a, 6)) 

fe=i 

n d 

fe=i 1=1 

n d—1 d—j 

c i,i+j\vij 0'i-H+J-l), (4) 

k=l j=l 1=1 
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= Z A /(«;x) + i c (a,0;x), 

where x = (xx,...,x n ). The ML estimator ML is obtained by maximising the 
above log-likelihood function over all parameters, 6 and a, simultaneously. Under 
the additional assumptions (M1)-(M8) of Lehmann [31] (pages 499-501), this cor- 
responds to solving the set of estimating equations (one equation per parameter), 
— J2k=i ML (^ifci ■ • ■ ? ^dk\ cc ML , ML ) = 0, which is a vector of functions, with elements 

i ml / n-, dlog(f 1 ... d (x 1 ,...,x (i ;a,d)) 
(j> l {x 1 ,...,x d ;a,B) = — , 

^ (5) 
,ML / a , _dlog(fi... d (x 1 ,...,x d ;a,e)) 

<Pd+{j-l)(d-j/2)+i\ x lT ■ ■ > x d;OL,V) — qTj 

for I = 1, . . .,d, i = l,...,d — j, j = 1, . . . ,d — 1. Define X as the corresponding Fisher 
information matrix 



X = E 



= E 



aiog(A... d (X; a, 6)) \ /91og(/ 1 ... (i (X; a, 0)) ^ 



d(a,0) J\ d{a,9) 
d 2 \og(f 1 ... d (X;a,8))\ _ f Z a T a>{ 



d(a,0)d(a,9)T J {Tie X 



In the last expression, it is partitioned according to marginal and dependence parameters. 
The corresponding inverse is 



X" 1 



(x(Q , e)) T x (0) 



xW = (x a -x M x e - 1 x^)- 1 , 

(6) 

f(a,8) _ -t-(q)-t- 7— 1 

It is well-known that under the mentioned conditions, the estimator ML is consistent 
for and asymptotically normal, that is, 

^(0 ML -0) ^AA(0,V ML ), 

yML = X W. 

In general, ML estimation of PCC parameters will require numerical optimisation. Even 
in rather low dimensions, such as four or five, the number of parameters is high if several 
of the model components have more than one parameter. For instance, a five-dimensional 
PCC, consisting of Student's £-copulae, has 20 parameters, to which one must add the 
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ones of the margins. Finding the global maximum in such a high-dimensional space is 
numerically challenging, even with more elaborate optimisation schemes, such as the 
modified Ncwton-Raphson method with first and second order derivatives. It will in any 
case be highly time consuming, so the ML estimator may not be an option in practice. 
Therefore, one needs faster and computationally easier estimation procedures. 

Moreover, the above results require the chosen model to be the true model, that is, the 
one that produced the data. If the specified model is close to the truth in the Kullback- 
Lciblcr (KL) sense, the ML estimator may behave very well (Claeskens and Hjort [7]). 
However, it is in general non-robust to larger KL-divcrgcnccs from the true model. 



2.2. Two-step estimators 



The next two estimators are not particularly designed for pair-copula constructions, 
but for multivariate copula models in general. Both consist of two steps, the first being 
estimation of the marginal parameters. 

2.2.1. Inference junction for margins (IFM) estimator 

The IFM estimator, introduced by Joe [23, 24], addresses the computational inefficiency 
of the ML estimator by performing the estimation in two steps. First, one estimates a 
by maximising the term Im from (4). The resulting estimates a IFM are plugged into the 
term l c to obtain IFM . 

Under conditions (M1)-(M8) (see Section 2.1), this corresponds to solving 



-^^ IFM (X 1 ,,...,X (i ,;a IFM ,^ 

fe=i 



with elements 



,ifm, „ s d\og{fi(xr,ai)) 



dcti 

(7) 

,ifm / m dlog(f 1 ... d (xi,...,x d ;a,d)) 
( Pd+U-i)(d~]/2)+iK x i^ ■■,x d ;ot,v) - ™ 

ua i,i+j\vij 

for I = 1, . . . , d, i = 1, . . . , d — j, j — 1, . . . , d — 1. Compared to the ML equations (5), the 
full log-p.d.f., log/i...d, is replaced with the marginal log-p.d.f.s, log/j, for the estimation 
of a. 

Consider a four-dimensional D-vine (3), consisting of Student's t-copulae, each 
having their own correlation and degrees of freedom parameter, combined with 
Student's i-margins. The parameter vectors are then a = (vi,!^, 1/3,1/4) and 6 = 

(Pl2,P23,P34,Pl3|2,P24|3 J Pl4|23 ) I / 12,^23,I / 34 ! ^13|2,I / 24|3 > I / 14|23)- IFM estimation of this 

model starts with a separate estimation of Vi, i = 1,2,3,4, margin by margin. The next 
step is to optimise ?c(Aij • • • , 1/4, 9; x) over 6, lc being the sum of the log-copula densities 
in line 3 to 8 of (3), over all observations. 
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Define the matrix K, a with 

K-a,i,j = E(</>J FM (X i ;a 4 )</)J FM (X : ,;a J )) 

= E f/ dlog(f i (X i ;a i )) W d log(/ 3 {Xj ; a, ) ) 
V V da.i ) \ dOLj 

and the block diagonal matrix J7" Q with 

<t vf 9 jJXMfY v>{ ^log^/i^aO)^ 

«7«,<,i = e ( - ^ 0, (x ( ,oti)j =e — j = /c Q , M , 

each block corresponding to one of the margins. If all margins are one-parameter families, 
7C a and 3 a are d x d matrices. More generally, their dimension depends on the number 
of parameters of each margin. 

Joe [24] showed that under the mentioned conditions, the estimator Q lFM is consistent 
for 9, as well as asymptotically normal: 

V^(0 IFM -0) ^AA(0,V IFM ), 

(8) 

-irlFM _ .7— 1 | -7— 1-7-T /y- 1 /T-l-T- -7— 1 V 

V - ± g +± g -L a gJ a K- a J a -L a fi-L 6 ■ 

The above covariance matrix is obtained by replacing 1^ in (6) with the asymptotic 
covariance matrix 1 /C a Jf^ 1 of a IFM . This quantifies the loss of asymptotic efficiency 
from discarding information the dependence structure might have on the margins. Sev- 
eral studies, including Joe [24] and Kim et al. [27], have demonstrated that unless the 
dependence between the variables is extreme, this loss tends to be rather small. That is 
also the impression from Examples 4.2 and 4.3 (Section 4). Moreover, the IFM method is 
computationally faster than the ML estimator, and can at least be used to set the start 
values in ML optimisation. Of course, for high-dimensional 0, the IFM estimator is still 
too slow to be used for PCCs. 

2.2.2. Semiparametric (SP) estimator for copula parameters 

Just like IFM, the SP estimator is a two-step estimator, treating the margins separately. 
It was introduced by Genest et al. [16], and for the censored case by Shih and Louis [36]. 
Later, it was generalised by Tsukahara [40]. Aas et al. [1] suggest this estimator for PCCs. 

As seen from (2), the pair-copula arguments at the ground level of a pair-copula con- 
struction (Ti in Figure 1) are marginal distributions Ft. From the second level, they are 
conditional distributions, whose conditioning set increases by one with each level. These 
conditional distributions may however be written as functions of the margins. Let i,j be 
distinct indices, that is, i ^ j , and v a nonempty set of indices, all from {1, . . . , d}, such 
that i,j ^ v. Then, in a simplified pair-copula construction (Joe [22]) 




Ui=F f \ v (xi |x„),«3=Fji„(ajj |x„) 



(9) 
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Thus, by extracting one of the variables j from the conditioning set vUj, one can express 
Fj|„uj as a function of two conditional distributions F^ v and Fj\ v with one conditioning 
variable less. Likewise, F^ v and Fj\ v may be written as bivariate functions of conditional 
distributions with a conditioning set reduced by one. Proceeding in this way, one finally 
obtains recursive functions of the margins. 

The type of pair-copula construction determines which conditional distributions are 
needed. At level j > 2 of a D-vine, these are the pairs (F^ Vi . (a^|xt, if ), F i+ j\ Vi . (xi+j |x„,y)), 
i = l,...,d — j. Now, define the functions 



, , n _ ^i,i+j\vij i u ii u i+j) 

^i,i+j\vij \ u i} u i+j ) — 



H+j,i\vij (Ui+j, - v — 

for i = 1 , . . . , d — j , j = 1 , . . . , d — 1 . Using (9) , one obtains 

Fi\vij ( x i \^-Vij ) 

= \\vi,j-\ (Fi\v it j-i i x i \ x Vi,j-i )j ( x i+j—l l x -Ui,j-i ))' 

Fi+j\vij \ x i+3 | x u»,j_i ) 

= hi+j,i+l\v i+ ij-i (Fi+j\v i+ i,j-i K x i+j l x «i+ij_i )j Fi+l\v i+ ij-i { x i+l | x Ui + i,j_i ))j 

which are functions of conditional distributions constituting the arguments of the copulae 
from the previous level, j — 1. As one continues this recursion, one achieves, as earlier 
mentioned, functions of the margins Fi, . . . , -Fi+j. Since these are needed in the asymp- 

totics, we denote them S^V+j an d 9i}+ji an d explicitly define them below. Note however, 
that for all practical purposes, such as in the estimation algorithm (Algorithm 1 in 
Supplement A of Hoback Haff [20]), one will use the nested /i-functions from (10). Define 



• u j 



(10) 



9i%j (ui,...,u i+j -i) = F i{vi . (F l 1 (u i )\F l+ \(u t+1 ),..., F i+ ) _ x (u i+j -i )) , 
9i%j(u i+ i,. . . , Ui +j ) = F i+j \ Vij (F^Ui+j^Fr^iui+i),. . . , i^_ 1 (<u i+ .,--i)) 
for i = 1, . . . , d — j, j = 1, . . . , d — 1. Now, one may rewrite (2) as: 
fi...d{.xi, . . . ,xa;cx,8) 



(11) 



Y[fi( x V,ati) 

1=1 

d-Xd-j 

■ \ }c iii+J -|^.(#V7 + ^ (12) 



3 = 1 i=l 



9 it i + j(Fi + i{x i+ i; ai+i), . . . , Fi + j(xi + j-,ai + j); i+ i^ i+ j); 9i,i+j\ Vij )• 



Parameter estimation for pair-copula constructions 9 

Recall that IFM estimates IFM are obtained by plugging the estimated marginal pa- 
rameters Q! IFM into the function lc ■ Semiparamctric estimation consists in replacing the 
parametric marginal c.d.f.s Uj = Fj(xj] atj) in lc with the corresponding empirical ones 



1 - 



1(A) 



fe=i 

1, if A is true, 
0, otherwise. 



The resulting pseudo log-likelihood function Iq p(0;x), given by 

Jc,p(0;x) 

n 

= y^log(c 1 ...d(Fi n (xi k ), . ..,F dn (x dk );9)) 
fe=i 

n d—ld—j 
fe=l j=l i=l 

(2) 

is just a function of 0. To obtain the semiparamctric estimator 9 , one simply maximises 
Zc,p(0;X) with respect to 0. 

Returning to the four-dimensional Student's i-vine of Section 2.2.1, SP estimation 
of this model requires a preliminary computation of the so-called pseudo-observations 
Uik.n = Fi n (xik), i = 1,2, 3,4, k = 1, . . . , n. The estimate 8 SP is obtained by maximising 
Zc,p(x;0), in this case 

n 

,71) ^2fc,7lj 

ia )) + . •• 

fc=i 

+ I0g(ci4|2 3 (5l4 ( u lk,n, ■ • ■ , U 3 k,n\ #l->3), ffi 4 ( u 2fc 02-^-4); ©14|23))), 

with 0i^3 = (0i2, ©23, #1312) an d #2^4 = (023, ^34, #24|3), over 9. 

In addition to the assumptions made for the ML estimator, assume that c\...d fulfills 
condition (A.l) from Tsukahara [40]. Then, the procedure corresponds to solving 

n 

- V <t> SP (F ln (X lk ), . . .,F ln (X dk ); 9 SP ) = 0, 

rj ^ — ' 

with 



n 
fc=i 



0O-i)(d-j/2)+ 4 ( M i, • ■■,u d ;0) = — (13) 

oa i,i+j\v i: j 



for i= j, j = l,...,d - 1. 
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Let U be a d-variate stochastic vector distributed according to the copula Cy...d(u\, . . . , 
Ud',0), and define 

SP , - f d 2 \ogc 1 ... d {u l7 ...,u d ;8) 
W| [U;0) = J dQdu I{U j <u j )dC 1 ... d {ui,...,u d ]0). 

Further, dchnc 

Bf = Var Wf (U; 0)) + £ Cov(0 SP (U; 6), Wf (U; 0)) 

= Var f £ W f ("J »)) + E Cov( 91 ° SCl ^ (U;0) , Wf (U; 0)) , 



where A = Cov(Y,Z), for two stochastic vectors Y and Z, is the matrix with elements 
Aij = Cov(Yi, Zj) + Cov(Yj,Zi). The matrix B^ p quantifies the effect of replacing the 
parametric marginal c.d.f.s with empirical ones. According to Genest et al. [16] and 
later shown by Tsukahara [40], SP is, under the mentioned conditions, consistent and 
asymptotically normal: 

(14) 

xrSP -7—1 , T-iRSPT-l v ' 

V - Xg +Xg tig ±g . 

Due to the completely separate and independent estimation of marginal and dependence 
parameters, the semiparametric estimator is more robust to misspecification of the mar- 
gins than ML and IFM (Kim et al. [27]). If cither of the latter two produce estimates 
that are rather different from the former, it indicates that the chosen margins or copulac 
are inadequate for the data. 

Computationally, SP is comparable to IFM. Hence, for high-dimensional 9, although 
faster than ML, this procedure will require good start values, and may still be too de- 
manding for PCCs. 



3. PCC parameter estimators 

If the number of PCC parameters is high enough, the estimators considered so far will 
be computationally too heavy. In any case, they necessitate appropriate start values. The 
next estimator, designed for pair-copula constructions, addresses this particular issue. 

3.1. Stepwise semiparametric estimator (SSP) 

As in semiparametric estimation, the marginal parameters are handled separately, and 
the parametric margins in the PCC log-likelihood function Iq are replaced with the non- 
parametric ones. The idea is to estimate the PCC parameters level by level, conditioning 
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on the parameters from preceding levels of the structure. Define 

n j 

lc,p,j(0i,- ■ ■ , Of, x) = ^2 ^2 M F in(xik), • • • , F dn (x dk ); 0i, . . . , t ), 



k=l 1=1 



with 



ipj(ui,...,Ud;d 1 ,...,dj) 

d-j 

= ^\og{c iti+j]v .. (g$ +j (ui, . . . , (15) 



i=i 



(2) 

9i,i+j{ u i+l, ■ ■ • j u i+ji0i+l-H+3j\6i,i+j\v tj )) 

for j = 1, . . . , d — 1. Hence, /c.p.j is the sum over all log pair-copula densities up 
to, and including, level j. To obtain the parameter estimates 9j P for level j, 
one plugs the estimates 0f SP , . . . , 0^ from preceding levels into lc,p,j an d max- 
imises it with respect to 6j. Assuming the standard conditions for the ML estima- 
tor are fulfilled (see Section 2), this corresponds to solving the estimating equations 
I ELi 4> SSP (F ln (X lk ), . . . , F dn (X dk ); SSP ) = 0, with 

Ft j 

<t>(j-i)(d- 3 /2)+i( u i, ■ ■■,u d ]8 1 ,...,6 j ) = — 2^ipi(ui, . . . ,u d ; 0i, . . . , 6i) 

OU i,l+j\v tj , -, 

a (16> 

ipj(ui,...,ud;0i,---,0j) 



^0i,i-t- - ?'|i; 



for i = 1, . . . , d — j, j = 1, . . . , d — 1. Compared to the SP equations (13), the full log 
copula density logci...^ is now replaced by the sum of log copula densities up to, and 
including, the level the parameter belongs to. The corresponding estimation procedure is 
presented in Algorithm 1 (Supplement A of Hoba;k Haff [20]). If none of the pair-copulac 
constituting the structure share parameters, which will usually be the case, the estimating 
equations are reduced to dg ^ log(ci ,i+j\ Vij )■ This means that the optimisation is 

performed for each copula, individually. 

Let us return to the four-dimensional D-vine considered in Section 2.2.2. As 
in the SP procedure, one computes the pseudo-observations Uik. n — Fi n (xik), i = 
1,2,3,4, k = l,...,n. One starts with the level 1 parameters, estimating each of 
the pairs (pi,i+i,v iti+1 ) by optimising Y^k=i^ S( c i,i+i( u ik, ni u l+ i.k,n\ Pi,i+i^i,i+i))-, for 
i = 1,2,3. One subsequently computes the copula arguments for level 2, Ujii+i^n = 

hi,i+l{Uik,n , Ui+i t k,n', Pi,i+1 , ^i.i+l)) and U i+2 \i+l,k,n = hi+2,i+l (Ui+2,k,n , Ui+l,k,n pi+l,i+2, 

£4+1,4+2)), i = 1,2,3, k = 1, . . . ,n, by plugging the resulting estimates into the adequate 
^.-functions (10). At level 2, one estimates each of the pairs (pi : i+2\i+ii v i,i+2\i+i) ■> f° r 
i=l)2, by maximising YJk=i lo g( c M+2K+i ( u i\i+i,k,n, v> i+2 \i+i,k,n> Pi,i+2\i+i, ^,i+2|i+i))- 
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Next, one computes the copula arguments Ui|23,fc,„ and u 4 \ 2 3.k,n f° r l eve l 3 by plugging 
the estimates from level 2 into h 13 \ 2 and /i 2 4|3 ■ Finally, to obtain (p 14 | 2 3 , £14123)7 one 

Optimises Yl=l l°g( c 14|23("l|23A"> M4|23,fc,«; Pl4|23, ^14|23))- 

When some of the copulae share parameters, the procedure is a little differ- 
ent. Let us for instance consider a four-dimensional Student's t-copula with corre- 
lations (pi2, P23, P34, Pi3j P24j P14) and v degrees of freedom. This is also a D-vine 
consisting of Student's £-copulae (see for instance Min and Czado [33]). The cor- 
relation parameters of these copulae are now the corresponding partial correlations 
(/°i2,/023,/034,/Oi3|2>P24|3>Pi4|23)- However, the degrees of freedom parameter is shared. 
More specifically, it is v for the three copulae at the ground level, v + 1 at level 2 and 
v + 2 for the top level copula. The SSP estimation procedure is now as follows. Having 
computed the pseudo-observations, one maximises the level 1 function 

n n 3 

^ ?Pl (Ulk,n, ■ ■ ■ , Uik.n, Pl2, P23, P34, v) = ^ log(Cj, i+ i (Ui ; k,n,Ui+l,k,n, Pi,i+1 , »)) 
k=l k=l i=l 

over (pi2,P23,P34,v). Then, one calculates the copula arguments for level 2 as described 
above. At the second level, one estimates P13 and p 2 4, which are not shared by c 13 | 2 and 
c 24 | 3 . More specifically, one optimises each of J2k=i lo g( c M+2|i+i («i|i+i,fc,n> M»+2|i+i,fc,n; 
Pi,i+2, Pi,i+i, Pi+i,i+2,t>)) , over Pi,i + 2, i = 1,2 (note that fci+i, pi+i,i+2 are needed to com- 
pute the partial correlations P;,i+2|i+i)- Next, one computes the copula arguments for the 
top level copula, and finally, one maximises Y2=i l°g( c i4|23Cwi|23,fc,nj U4|23},fc, n ; P14, P12, 
■ ■ ■ 1 P24|3> ^)) over P14. Note however that although it is possible to estimate the parame- 
ters of a multivariate Student's i-copula as described above, it is unnecessarily complex. 
In practice, one would typically estimate the correlation parameters via the correspond- 
ing Kendall's r coefficients, and subsequently optimise the pseudo log-likelihood function 
lc,p over v, plugging in the estimated correlations, as described in for instance McNeil 
et al. [32] (page 231). The main purpose of the PCC is to model pairs with different 
behaviour. If one does not really need that flexibility, then using a PCC is like using a 
sledgehammer to crack a nut. 

Let us now consider conditions (A.1)-(A.5) from Tsukahara [40]. The last four of 
these are the standard conditions for the ML estimator, but on the score functions (16). 
Further, define 

d 

<l>(j-W-j/2)+i (u; 0i > • ■ • , fy) = qq — — V>j 0; 0i , • ■ • , 0j ) = il> ij>e (u; 0i , . . . , 0,-) , 



d 

— 4> u ^ 1)(d „ j/2)+l {u; 1 ,...,0 j ) = —ipijjiu; 0i, . . . , 0j) = V>*j,<?,u* (u; 0i, • • • , 0j). 



Let Q and 1Z be the sets of positive, symmetric, inverse square integrable functions on 
[0,1] and reproducing u-shaped functions on [0,1], respectively, as defined in Tsuka- 
hara [40]. Further, let \0ij\ Vi . | = hj be the number of parameters of the pair-copula 
@i,i+j\vf F° r the SSP estimator, Condition (A.l) may then be phrased in the following 
way (note that a subscript i j' on (ft is missing in Tsukahara [40]). 
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Condition 1. For each 9, xf> ije = (V>y,e,i, • • • , i>ii,e, hj ) and ^ijfi^k = {^ij,e,u h ,i, ■ ■ ■ , 
V , ij,9,«fe,i-t,))i = — l,i = 1, — j are continuous, and there exist functions 

r ij,k,fij t k G 72. and qij t k G Q, sucft i/iai 

d 

|^-,e,m(u;0i,...,0j)l ^ H r «,«( u 0! 
\ipij,e,u k , m (u; Ox,..., 6j)\ < f ijik (u k ) nj t i(ui) 

l^k 

for k, I = 1, . . . , d, j = 1, . . . , d — 1, i = l,...,d — j, m = 1, . . . , l%j , with 

J ^JJ/ij,l( u l)j dCi...d(ui,...,u d ;6) < oo, 
/ f qij,k(uk)hj,k{ u k) Y\. r vA u l) ) dCi...d(ui,...,u ci ;0) < oo. 

When none of the pair-copulae share parameters, Condition 1 becomes a condition on 
each of them, individually. 

Once more let U be distributed according to Ci... d (ui, . . . ,u d ;6), as well as xj) g = 

(ipii,e,---," t Pi,d-i,e) and ^e,uj = {'4 } ii,e,up---^i,d-i,e,u s )- Defme 

Wf p (U;0) = J ■^-cf> SSP (u 1 ,...,u d ;0)I(U J <u J )dC 1 ... d (ui,...,u d - 1 0) 

= J ^6»,Mj Kv, u d ; 9)I(U.j < Uj)dCi...d{ui, ...,u d ;8) 

and the matrix 

Bf p = Var (]T Wf P (U; 0)") + ]T Cov(0 SSP (U; 0), Wf P (U; 0)) 



< d 



d 



Var J2 Wf P (U; 0) + £ Cov(^(U; 0),Wf p (U: 0)). 



Moreover, define the two matrices 



7C 9 = E(</> SSP W, SS T) 



V 



T Xe.d-l.d-l' 
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E - 



Je 



where the blocks 7Q 



d<jy 



SSP 



dO 1 



( 3 9,1,1 



e,d-2,i 



»7 e,d-2,d-2 
1-6,d-\,d-2 1-6,d-\,d-\) 



E(( *kv^VT 



correspond to each of the construction's levels. The block diagonal and block lower tri- 
angular forms of K-e and jTg, respectively, follow from the structure of the estimating 
equations (see Appendix A.l). More specifically, the tp functions depend on all the param- 
eters from previous levels but not from following levels. Further, the estimating equations 
for the top level copula parameters are based on the full copula, as for the SP estimator. 
This accounts for the appearance of blocks from the Fisher matrix Xg in the last rows 
of K-e and Se- If all pair-copulae are from one-parameter families, then 7C# and jTg are 
d(d- l)/2 x d(d- l)/2 matrices. 

We now have all the necessary components to establish the asymptotic properties of 
the stepwise semiparametric estimator. 

Theorem 1. Under Condition 1, as well as Conditions (A. 2)— (A. 5) of Tsukahara [40], 
the SSP estimator 9 SSP is consistent for 6 and asymptotically normal: 



-E(. 



h3 



1, 



.,d-l, 



V 



SSP 



■AA(0,V SSP ), 
Jo 1 K. g {J g 1 ) T 



-l\T 



(17) 



Proof. Theorem 1 follows directly Theorem 1 of Tsukahara [40], with the estimating 
equations (16). Note that Theorem 1 of Tsukahara [40] is valid for the multiparameter 
case m > 1, despite some misprints and imprecisions in the original paper. Specifically, 
Condition (A.l) is assumed valid for every element of the vector <p of estimating equa- 
tions, that is, a subscript 'j' is needed. Proposition 3 holds for m > 1 thanks to the 
Cramer- Wold device. The rest of the argument works for m > 1, using || • || instead of | • | 
for the norm (Tsukahara [41]). □ 

In order to construct confidence intervals for 9, one needs a consistent estimate of 
V SSP . As noted in Tsukahara [40] , one may estimate this covariance matrix consistently 
by replacing expectations and variances in (17) by sample equivalents, and plugging in 
the estimate 6 SSP . More specifically, letting %l) e e = g|^i/> e W SSP = Ylj=i W^ SP , and u ik , 
i = 1, . . . , d, k = 1, . . . , n, are the pseudo-observations, 

V SSP = Je l t e {J^) T + J^Bf^JjT, 

with 

1 ™ 

K = - V ^ fl (« lfc> . ■ • , u dk - 9 SSP )(iP e (u lk ,. . . , u dk - SSP )) T , 
fc=i 
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- 1 \ - , -SBF 

Je = — > ip 6 Auik,---,Udk;0 ), 
n z — ' 

fe=l 

Bf p = I T W SSP (u lk , ...,u dk ; 9 SSP )(W SSP (u lk , ...,u dk ; SSP )f 
n £ — ' 

fe=l 

1 n d 

+ - E E <M«i*. • • • > e SSP )(w SSP ( Wlfe , ...,u dk; ssp )f . 
n *!=i j=i 

In most cases, there is no analytic expression for the derivatives ip g and %p g g , but they 
can be approximated numerically. However, the computation of the Wj-vectors involves 
G?-dimcnsional integrals, which is more problematic. In practice, one will not be able 
to compute the above covariance matrix estimate for d > 3. Instead, one will have to 
resort to some resampling technique, such as parametric bootstrap from Ci.. .<$(•; 9 SSP ), 
as described in Example 4.4 of Section 4. 

Theorem 2. Under the conditions of Theorem 1, the SSP estimator 9 SSP is asymptot- 
ically semiparametrically efficient for the parameters 9 of the Gaussian copula. 

The proof is given in Appendix A. 2. 

In general, the stepwise semiparametric estimator SSP is asymptotically less efficient 
than 9 SF , since it at a given level discards all information from following levels. Nonethe- 
less, the levelwise estimation significantly improves the computational efficiency. The SSP 
estimator is therefore adequate for medium to high-dimensional models, and to produce 
start values for the SP estimator. Further, a substantial difference between SSP and SP 
estimates may be a sign that the copulae are unsuitable. Hence, one may use the SSP 
estimator to assess the sensitivity to the chosen copulae. Moreover, it is inherently suited 
for determining an appropriate PCC for a data set, which consists in choosing an order- 
ing of the variables and a set of parametric pair-copulae in a stepwise manner. Once the 
ordering is fixed, one finds suitable copulae for the ground level, based on the pseudo- 
observations. At the second level, the necessary pair-copula arguments are obtained by 
transforming the pseudo-observations with the adequate /i-functions, which depend on 
the chosen ground level copulae. This requires ground level parameter estimates, which 
can be provided by the SSP estimator. After one has selected copulae for the second 
level, one proceeds in the same manner for the remaining levels. Of course, one could 
construct a similar, stepwise estimator with a different transformation to uniform mar- 
gins, for instance using the parametric margins as in IFM estimation. That particular 
estimator was in fact proposed by Joe and Xu [26]. 

3.2. Robustness 

The SSP estimator is a substantial improvement over the three former in terms of com- 
putational speed. However, it presupposes that the specified model is the true one. If 
the amount of data available is high enough, it should, in most cases, be possible to find 
adequate marginal distributions. For the pair-copulae, the task is more complex. Using 
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the pseudo-observations, one may obtain a reasonable model for the ground level. Subse- 
quently, however, one must condition on choices from previous levels, as described above. 
One would therefore expect the quality of the model to decrease with the construction 
level. 

SSP estimation consists in replacing the parametric margins in the function Iq with the 
nonparametric ones, while keeping the parametric forms of the conditional distributions, 
that is, the (/-functions (11). The resulting estimator is robust toward misspecification of 
the margins, but not of the pair-copulae. By replacing also the conditional distributions 
with nonparametric versions, one would reduce this sensitivity to chosen pair-copulae 
preceding in the structure. One possibility is the empirical conditional distribution pro- 
posed by Stute [39]: 

71 + 1 2^fc=i-^s(x„ - x„ ife ) s l \s 

where I is the dimension of x„, K is a kernel function on M. 1 and s the bandwidth 
parameter. The definition (18) is slightly modified here to avoid boundary problems in 

and 1. Provided h—tO and ns l — > oo as n — > oo , it converges almost surely to the true 
conditional distribution, though at a rather slow pace of order [ns 1 ) 1 / 2 . The quality of the 
estimates will therefore significantly decrease with the level number. Alternative versions 
of the empirical conditional distribution function, such as the one proposed by Hall and 
Yao [18], share this unfortunate property. 

Recall that the conditional distributions of interest are recursions of h- functions (10), 
which, in turn, are conditional distributions of uniform variables with a conditioning set 
of length one. These functions can therefore be estimated nonparamctrically by (18) with 

1 = 1. Seemingly, one can exploit this to avoid the curse of dimensionality. However, the 
two arguments of the ^.-functions are again /i-functions from the preceding level. Hence, 
the error propagates from level to level, and as expected, the resulting rate of convergence 
is of the same order as for the original variables, that is (ns 1 ) 1 ^ 2 . 

Accordingly, the estimator suggested above becomes unreliable already at the fourth 
or fifth level of the structure, depending on the amount of data. Since the intention is to 
improve the quality of estimates at higher levels, it is in practice useless, unless the rate 
of convergence is increased by additional assumptions on the conditional distributions. 

3.3. C-vines 

For simplicity, we have only considered D- vines so far. The same results are however easily 
obtained for C-vines (see Figure 1) and more general regular vines (though computation 
of the log-likelihood function is more complex (Dissmann el al. [12])). 
The p.d.f. of a C-vine is given by (Aas et al. [1]) 

fi...d(xi,...,Xd',a,6) 

d 

i=i 



Parameter estimation for pair-copula constructions 



17 



(19) 

d-ld-j 

' ]^[ Y\. C j>3+i\zj-i,a (-^J>3-i,o ( x jl x Zj-i,o ! OL z j0 , 9j-<)), 
3 = 1 »=1 

■^j'+i|«i-i,o ( x j+i| x 2:j-i,o i a Zji > %,j+i|a 3 _i,o )) 

where z i? = {1, . . . , j - 1, j + i} and 9j_i = {0 s , s+i | Zs _ ljQ : (s,s + t) € Zji}, for 0<i<d-j, 
< j < d — 1, with zoi = 0, zii = {i + 1} and <?o_; = Qij, = - Hence, the log-likelihood 
function of n independent observations from a C-vine is 

n 

Z(a,0;x) = y^log(/i„,d(a:ifc, . . . , x dfc ; a, 0)) 
fe=i 

n d 

fc=i i=i 

n d—ld—j 

+ EEE lo g( c jj+i|*i-i,o (^ki-i.o (*3l x »i-i,o i «*joi fyjo). (20) 
fc=l j=l i=l 

■^i+t|*f_i,o W+iNzj-i.o i a Zji ) @jJ+i\zj-i,o)) 

= Z M (a;x) + l c {a,8;x). 

Replacing ?c from (4) with lc from (20), one retrieves the results from Section 2 for 
C-vines. To achieve the SSP estimator, one must simply replace the psi-function (15) in 
the estimating equations (16) with 

ipj(ux,...,u d ;Oi,...,dj) 

d-j 

= ksfej-Ml^-i.o ( F 3\z 3 -i,o ( X 3 l X *J-i,o i «« J0 » J-o)> ( 21 ) 
i=l 

^j-H|*i-i,o Ofy'-H l x Zj-i,o i ) ^j'j+i|zj-i,o))- 

Also, the ^-functions (10) are redefined as 

ft j-Nj|*i-i,o( u i-H> u j) = ^ ( 22 ) 

for i = l,...,d — j, j = 1, . . . , d — 1. The estimation procedure for a C-vine is described 
in Algorithm 2 (Supplement A of Hobask Haff [20]). 



4. Examples 

To compare the four estimators' performance, we have carried out asymptotic computa- 
tions on a few examples (Examples 4.1 to 4.3). We have also fitted a D-vine to a set of 
precipitation series, using each of the estimators (Example 4.4). 
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Example 4-1- Consider the three-dimensional Gaussian distribution 

XA \ ( I p 12 p 13 \ 

X 2 ~JV 3 (0,SRS), S= -T 2 , R = \ P i2 1 P23 ■ 
*s/ V a 3 J \p 13 p 23 1 / 

This distribution can be represented by a D-vine consisting of Gaussian pair-copulae and 
margins, more specifically 

3 

/i23(^i, x 2 , x 3 ; S, R) = c{ui,u 2 ; pi2)c(u 2 ,u 3 ; p 23 )c(u 1 \ 2 ,u 3 \ 2 ; p 13 \ 2 ) J| f{x l \a l ), 

i=l 

where 

cxp{-p/(2(l - p 2 ))(p^>- 1 (u t ) 2 + p^juj) 2 - 2$- 1 (^)3>- 1 K))} 
exp{-a; 2 /(2g 2 )} p 13 ~ P12P23 

v*iv ' Pl3|2 ~ ^(i-p{ 2 )(i-Pi 3 y 

h ij (u i ,u j ;p) = $^ yY^2 J' 

with Mi = $ being the c.d.f. of the standard Gaussian distribution. Note that this 

is one of the three possible decompositions of /i23- 

In practice, there are scarcely any other models for which it is feasible to do all com- 
putations analytically. It is also one of the few distributions the IFM and SP estimators 
are asymptotically efficient for, as explained below. 

The ML estimators cc ML and ML are of course the empirical standard deviations 
and correlations, respectively. It is easily verified that for this particular model, the IFM 
estimators a IFM and Q lFM are identical to a ML and ML . Thus, they are asymptotically 
efficient. Moreover, the SP estimator 8 SP is semiparametrically efficient for 0, as shown 
by Klaassen and Wellner [28]. 

For SSP, we must compute the matrices TCg, J'q and Bg SP , defined in Section 3.1. The 
covariance matrix 

v ssp of 

Pi2,P23 and pi 3 , in that order (corresponding to the PCC 
levels), is shown in Appendix A. 3, along with V ML . We see that V SSP = V ML = V SP . As 
the SP estimator is asymptotically semiparametrically efficient for 6, so must the SSP 
estimator be. 

Example 4-2. Consider the three-dimensional PCC with exponential margins and Gum- 
bel pair-copulae: 

3 

fi23{xi,x 2 , x 3 ; A, 6) = c(ui,u 2 ; S 12 )c{u 2 ,u 3 ; <5 2 3)c(u 1 | 2 , m 3 | 2 ; <5 13 | 2 ) /(a;*; Aj), 



f{x;cx) 



H\j 
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Table 2. Asymptotic relative efficiencies of S12 and 5 13 | 2 from Example 4.2, for various param- 
eter sets 







IFM 




SP 




SSP 




5l2 


5l 3 \2 


S12 


#13|2 




<5l3|2 


(5l2 ,5l3|2) 


= (1.2,1.2) 


0.997 


0.997 


0.921 


0.955 


0.904 


0.953 


(<Sl2, <5i 3 | 2 ) 


= (1.2,2) 


0.985 


0.996 


0.902 


0.984 


0.891 


0.981 


(<5l2,<$i3| 2 ) 


= (1.2,3) 


0.971 


0.994 


0.846 


0.990 


0.837 


0.987 


(5l2,<5i3| 2 ) 


= (2,1.2) 


0.995 


0.985 


0.913 


0.851 


0.879 


0.843 


{Si2,3 13 \ 2 ) 


= (2,2) 


0.981 


0.983 


0.896 


0.950 


0.850 


0.936 


(8i2,S 13 \ 2 ) 


= (2,3) 


0.956 


0.969 


0.832 


0.976 


0.815 


0.962 


(8i2,8 13 \ 2 ) 


= (3,1.2) 


0.995 


0.974 


0.912 


0.814 


0.861 


0.808 


(8i2,5 13 \ 2 ) 


= (3,2) 


0.973 


0.954 


0.871 


0.921 


0.843 


0.887 


(Si2,6 13 \ 2 ) 


= (3,3) 


0.944 


0.932 


0.825 


0.951 


0.777 


0.931 



where 



exp{-(Gf + tyVSKujUj)*- 1 ^ + v*) 1 ' 6 + 5-1) 
UiUj(uf + u s j ) 2 - 1 / s 



exp{-(u? +u s j ) 1 / 5 (u j ) s - 1 } 
f(x;X) = Aexp{-Aa;}, u l{j = hij(ui,Uj;6) = s ~su-i/s > 

with Uj = 1 — exp{— XjXj} and Uj = — log(uj),i,j = 1, 2, 3. For various parameter sets, we 
have computed the covariance matrices by numerical derivation and integration. Since 
the dependence parameters 5 are our primary interest, we let Ai = A2 = A3 = 1 in all sets. 
Moreover, we let S12 = 623- Table 2 shows the resulting asymptotic relative efficiencies of 
the ground and top level parameter estimators, (^12,^23) and 5 13 | 2 , respectively, that is, 
the ratios between the variances of the ML and alternative estimators in question. In a 
Gumbel copula, the dependence increases with the parameter 6. Kendall's r is when 
5 = 1 and tends to 1 as 5 — >• 00. The three estimators are rather efficient in general, with 
IFM on top, followed by SP and finally SSP. As the true margins are known, this is not 
that surprising, and agrees with the results of Kim et al. [27]. All three estimators lose 
asymptotic efficiency with increasing dependence at the ground level, that is, for i5i2 and 
623 , whereas SP and SSP gain efficiency at the top level. The asymptotic variances of all 
three estimators actually decrease with increasing dependence at both levels, though not 
as fast as for ML. As expected, SSP is overall less efficient than IFM and SP, but the 
difference is quite small at the top level. 



Example 4-3. Consider the five-dimensional D-vine with Student's i-margins and Stu- 
dent's i-copulae: 

5 4 5-j 

fl2345(xi,X 2 ,X 3 ,X4, X 5 ; V M ,p, U C ) = ^\_f{xi\Vi) ]J c(u i | l , ij , U i+j \ Vij ; Pi,i + j\ Vij , u i,i+j\vij ), 

1=1 j=l i=l 
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with u M = (i/i,...,i/ 5 ), p= (pi2,---,Pi5|234), "C = (^12, •■•,^151234), and 

/(*•„) = r((^ + i)/2) / 1 + ^- ( - +1)/2 



2 / v 2 / V ^ / V " 

«/ + r 



2 



1 , t- 1 K) 2 + t ^ ; 1 K^) 2 -2 /3 v 1 K)V 1 K-)^ (^y+2)/2 



^il^ij ^i^ilvij-i •> ^i-j-j— l\vi t j—i ) Pi,i-\-j— l\vij — i 1 ^i,i-\-j—X\vi t j—i ); 

M«, V ;p^) = Wi(^TT(t l ; 1 ( u )-^; 1 W)((^ + t 1 ; 1 M 2 )(i-P 2 ))~ 1/2 ), 

with Mi = t u (xi), t v being the c.d.f. of the Student's i-distribution with v degrees of 
freedom. This is a five-dimensional extension of the example model considered in Sec- 
tions 2.2.2 and 3.1, that is, none of the copulae share parameters, nor do the margins. 
The number of parameters is therefore 25. 

In this case, it is infeasible to compute the asymptotic covariance matrices, both ana- 
lytically and numerically Therefore, we resort to simulation and Monte Carlo methods. 
More specifically, we have generated N = 250 samples of size n = 10,000 from the above 
distribution with four different parameter sets. For each sample, we have estimated the 
PCC parameters p and vc using the four estimators. Finally, we have computed the sam- 
ple covariance matrices of the resulting estimates. The four parameter sets we have con- 
sidered are (p, v) = (0.3, 6), (0.7, 6), (0.3,20), (0.7, 20), where we let p 12 = ■ ■ ■ = P151234 = P, 
Vyi = ■ ■ ■ = ^i5|234 = v, fixing the marginal parameters at v\ = ■ ■ ■ = v$ = 6. Table 3 shows 
the resulting relative efficiencies averaged over each level. The three estimators behave 
rather similarly, although IFM once more appears to be the most efficient, SP the second 
and SSP the last. More specifically, their efficiency decreases with increasing dependence 
(cither higher correlation or lower number of degrees of freedom) at all levels of the 
structure. Furthermore, they all become more efficient with increasing level number. In 
particular, the SSP estimator gains with respect to its competitors at the higher levels, 
just as for the Gumbel vine in Example 4.2. Note that an increased efficiency is not 
synonymous with a lower estimator variance, but only measures the behaviour relative 
to the ML estimator. Actually, the variances of all four estimators increase with the level 
of the structure, as one would expect. 



Example 4-4- Finally, we have fitted a D-vine to a set of daily precipitation val- 
ues recorded from 01.01.1990 to 31.12.2006 at five different meteorological stations 
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Table 3. Asymptotic relative efficiencies of p and vc from Example 4.3, averaged over each 
level, for various parameter sets 



Level 1 Level 2 Level 3 Level 4 





























P 


V 


P 


V 


P 


V 




P 


V 










IFM 














(p> v ) 


= (0.3,6) 


0.988 


0.996 


0.988 


0.997 


0.998 





996 


0.997 


0.998 


U>, v ) 


= (0.7,6) 


0.935 


0.913 


0.961 


0.988 


0.984 





.979 


0.968 


0.996 




= (0.3,20) 


0.997 


0.996 


0.993 


0.999 


0.990 





.998 


0.997 


0.999 


(ft,v) 


= (0.7,20) 


0.952 


0.992 


0.962 


0.993 


0.969 





.988 


0.991 


0.989 










SP 
















= (0.3,6) 


0.952 


0.985 


0.973 


0.987 


0.991 





.992 


0.998 


0.997 


(P,v) 


= (0.7,6) 


0.872 


0.883 


0.915 


0.956 


0.974 





.965 


0.988 


0.977 


(P, v ) 


= (0.3,20) 


0.965 


0.963 


0.994 


0.970 


0.989 





.991 


0.983 


0.992 


(P, v ) 


= (0.7,20) 


0.938 


0.926 


0.966 


0.992 


0.958 





.985 


0.994 


0.990 










SSP 














(P,v) 


= (0.3,6) 


0.890 


0.907 


0.932 


0.937 


0.938 





.985 


0.946 


0.996 


(P,v) 


= (0.7,6) 


0.852 


0.855 


0.861 


0.934 


0.925 





.948 


0.967 


0.959 


(P,v) 


= (0.3,20) 


0.941 


0.955 


0.992 


0.964 


0.981 





.975 


0.973 


0.974 


(P,v) 


= (0.7,20) 


0.870 


0.911 


0.950 


0.990 


0.968 





.981 


0.980 


0.985 



in Norway; Vestby, Ski, L0renskog, Nannestad and Hurdal, shown on the map (Fig- 
ure 2, Supplement B of Hobaik Haff [20]). These data were provided by the Norwegian 
Meteorological Institute. Moreover, this is one of the data sets studied in Berg and Aas [4], 
extended with the series from L0renskog. We have followed their example and modelled 
only the positive precipitation, that is, we have discarded all observations for which at 
least one of the stations has recorded zero precipitation, leaving 2013 out of the original 
6209. The aim is to remove the temporal dependence between the observations, in ac- 
cordance with our assumptions. Autocorrelation plots of the resulting data set indicate 
that this is reasonable. 

Since rain showers tend to be very local, we expect the dependence between measure- 
ments from two proximate stations to be stronger than from stations that are farther 
apart. As the stations almost lie on a straight line (see Figure 2, Supplement B of Hobaok 
Haff [20]), a D-vine ordered according to geography is a very natural model. More specif- 
ically, the chosen dependence structure is the left-hand side of Figure 1, with Vestby, 
Ski, L0rcnskog, Nannestad and Hurdal as variables 1, 2, 3, 4 and 5, respectively. To 
find adequate copulae for our structure, we computed the pseudo-observations, shown in 
Figure 3 in Supplement B of Hoback Haff [20]. There are strong indications of upper, but 
not of lower tail dependence. We therefore chose Gumbel copulae at the ground level. An 
inspection of the data transformed with the estimated /i-functions from the preceding 
level (as described in Section 3.1) indicated that Gaussian copulae would be reasonable 
for the three remaining levels. Finally, according to histograms of the data (shown in Fig- 
ure 4, Supplement B of Hobaok Haff [20]), the generalised gamma distribution (Stacy [38]) 
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Table 4. Estimated parameters with 95% confidence intervals for the precipitation data set of 
Example 4.4 



Lev. Par. 


ML 


IFM 


SP 


SSP 


1 B 

1 012 


4.56 


4.37 


4.32 


4.32 




(A A A 4 71') 


(A 18 4 56") 


(A 1 4 4 Kffi 


(A 1 4 4 50") 


0'2,'> 


3.02 


2.92 


2.91 


2.90 




(2.91, 3.13) 


(2.80, 3.04) 


(2.79, 3.03) 


(2.79, 3.03) 


034 


2.53 


2.47 


2.47 


2.47 




(2.44, 2.62) 


(2.37, 2.57) 


(2.37, 2.57) 


(2.37, 2.56) 


045 


3.59 


3.48 


3.44 


3.44 




(3.45, 3.73) 


(3.34, 3.62) 


(3.30, 3.58) 


(3.30, 3.58) 


2 #13 1 2 


-0.17 


-0.17 


-0.17 


-0.17 




(-0.21, -0.13) 


(-0.21, -0.13) 


(-0.21, -0.13) 


(-0.21, -0.13) 


024\3 


0.21 


0.20 


0.21 


0.21 




(0.15, 0.27) 


(0.16, 0.24) 


(0.17, 0.25) 


(0.17, 0.25) 


035 1 4 


0.066 


0.067 


0.061 


0.061 


(0.022, 0.11) 


(0.031, 0.10) 


(0.023, 0.099) 


(0.024, 0.098) 


3 #14|23 


0.093 


0.088 


0.081 


0.081 




(0.055, 0.13) 


(0.053, 0.12) 


(0.044, 0.12) 


(0.044, 0.12) 


#25|34 


0.050 


0.043 


0.033 


0.033 




(0.009, 0.091) 


(0.008, 0.079) 


(-0.003, 0.070) 


(-0.003, 0.070) 


4 #15|234 


0.040 


0.045 


0.046 


0.046 




(0.006, 0.075) 


(0.007, 0.083) 


(0.012, 0.080) 


(0.012, 0.080) 



with p.d.f. 

/( ^ 7 ' Ap)= ^ife)^ lexp {-(i) P } 

appears to be suitable for the margins. This distribution is gamma for p = 1 and expo- 
nential if in addition 7=1. Both the ML and the IFM estimates of 7 and p were rather 
different from 1, which confirms that the margins are neither exponential nor gamma 
distributions. The actual fitted marginal p.d.f.s are shown in the histograms of the data 
(Figure 4). 

We have fitted the described model with each of the four estimators, using the R- 
routine optimO . The resulting estimates are shown in Table 4, along with 95% confidence 
intervals. These were computed by 9 ± $~ 1 (0.975)se, for each of the ten parameters, 
where se is an estimate of the parameter's asymptotic standard deviation. For the ML 
estimator, we computed the sample Fisher matrix 

1 " 3 
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where a = (7i,/3i,pi, . . . , 75, /?5,??5), the derivative being calculated numerically. The 
estimates se were then simply the square roots of the diagonal entries of p". For 
the three remaining estimators, we used parametric bootstrap to obtain se. More 
specifically, we generated B = 500 bootstrap samples from iq... 5(2:1, . . . , 215; a IFM , IFM ), 
Ci... 5(1*1, . . . ,u 5 ; SP ) and Ci... 5(1^1, . . . , 115; SSP ), estimating the parameters a IFM,b , 
^ipm,6 ^SP,6 ) gSSP,fe ) h = l,...,B. Finally, we let the ses be the sample standard devi- 
ations of the bootstrap estimates. 

At the ground level, the parameter estimates are overall high. This indicates a strong 
positive dependence between large amounts of precipitation in stations that are close in 
distance, as anticipated. The IFM, SP and SSP estimators give similar values. However, 
the ML estimates are rather different, though the 95% confidence intervals overlap with 
the other estimators'. As noted earlier, this indicates that the chosen univariate margins 
or copulae are not quite adequate. Since the SP and SSP estimates arc virtually the 
same, the problem appears to be the margins. 

The second level models the conditional dependencies of two stations that are separated 
by one, given the one between them. All four estimators agree that this conditional 
dependence is negative between Vestby and L0renskog, positive between the pair Ski 
and Nannestad, whereas L0renskog and Hurdal are almost conditionally independent. 
At the top two levels, the estimated copulae are close to the independence copula, as 
expected. Actually, the SP and SSP confidence intervals indicate that the copula C25134 
is not significantly different from independence, which can be an important aspect for 
practical purposes. 



5. Concluding remarks 

There are various estimators for the parameters of a pair-copula construction, among 
those the stepwise semiparametric estimator, which is designed for this particular depen- 
dence structure. Although previously suggested, it has never been formally introduced. 
In this paper, we have presented its asymptotic properties, as well as the estimation 
algorithm for the two most common types of PCCs, namely D- and C- vines. 

Compared to alternatives such as maximum likelihood, inference functions for margins 
and semiparametric estimation, SSP is in general asymptotically less efficient. The SSP 
estimator has a higher variance than the alternatives. Nonetheless, the loss of efficiency is 
rather low, and decreases with the construction level, as shown in a couple of examples. 
For the set of five precipitation series, the SSP estimates are actually almost indistin- 
guishable from the SP ones. Moreover, the SSP estimator is semiparametrically so for the 
Gaussian copula. To compare the alternative estimators' performance more thoroughly, 
we plan to perform a large simulation study. 

One of the main advantages of the SSP estimator, is that it is computationally tractable 
even in high dimensions, as opposed to its competitors. Moreover, it provides start values 
required by the other estimators. Finally, determining the pair-copulac of a PCC is a 
stepwise procedure, that involves parameter estimates from preceding levels. The SSP 
estimator lends itself perfectly to that task. 
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For simplicity, we have only considered C- and D-vines. Equivalent results are, how- 
ever, easily obtained for the more general class of regular vines. Further, we have par- 
titioned the parameter vector into marginal and dependence parameters. This excludes 
some distributions, such as the multivariate Student's t. However, if one does not need 
the flexibility to model the margins and dependence structure separately, as well differ- 
ent types of dependence between the various pairs of variables, a PCC is unnecessarily 
complex. Moreover, we have assumed the observations to be independent, identically 
distributed. In practice, the parameter estimation often includes a preliminary step to 
deal with deviations from these assumptions (Chen and Fan [5]), for instance GARCH 
filtration of time series data. The effect of such an additional step on the SSP estimator 
is a subject for future work. 

Appendix 

A.l. Matrices K-e and J e 

As stated in Section 3.1, the matrices K-e = ^(tpgtp^) and J~e = E(— ipg e ) are block 
diagonal and block lower triangular, respectively, that is, TCg, i.j =0,i^j and jTg.i.j = 0, 
i <j. This follows from the structure of the V'-functions, as shown below. 

We start with Jes,^ where i <j. Then, J~e,Lj = E(— 9 ^("l^-.Md.gi,— ^ w ith ^ 

from (15). Since none of the copulae at level i are functions of the parameters at a 
following level j, a^(«i.-^;gi,-.«0 = HcnCG; j g i j = i < j 

Assume now that i < j , and let u = (u±, . . . , ltd) = ( u w ki i u -w ki )- Then, 



dipi (ui , . . . , %d\ Ox , • • • , 0i) \ ( dipj (ui , . . . , Ud; 9i , ■ ■ ■ , 9 



d6i J \ 89 j 



'j 



d-i „ d—j 



= I ^^ l0 S C k,k+ t \v kl ^T^OgC lil+]lvij C 1 ... d du 
Ju 1 k=l °°3 1=1 



k=l 1=1 



ci,i+j\ Vl] d9 3 



Under the conditions of Theorem 1, we may exchange the integration and differentiation 
in the inner integral. Thus, 



K e,i,j = f W- lo & c *.*+i|« fci if/ c '-' +j| ^ Cl - d du_ u , fci ) du 

k=l 1=1 J ^ ° * d6 3 V»— M %t+Mi 



d—i d—j 



k=l 1=1 
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d—i d—j r Q Q 

J -QQ: lo ^ c k,k+i\v ki -^rC Whi AM Wki 



The pair-copulae composing c Wki , situated in levels 1, . . . , i, are not functions of param- 
eters from a following level j. Thus, -gg-c Wki = 0. Consequently, Ke,i,j =0,i < j. The 
exact same argument can be repeated for i > j . Hence, K-e,i.j = 0, i ^ j. 



A. 2. Proof of Theorem 2 



Proof. In two dimensions, the SSP estimator is the same as the SP estimator, which 
was shown to be semiparametrically efficient by Klaassen and Wellncr [28]. In three 
dimensions, we have computed the asymptotic covariance matrices for comparison. As 
shown in Example 4.1, the covariance matrices of the SP and SSP estimators, V SP and 
V SSP , respectively, are equal. Thus, the SSP estimator is semiparametrically efficient 
also for the three-dimensional Gaussian copula- 
Assume now that it is true for the (d — l)-dimcnsional Gaussian copula. Further, for 
the d-dimcnsional model, partition the covariance matrix V SP as ~Vf p = Vp p _ 2 1 d 2 , 
Vi2 P = V^d-^d-i and V 2 SP = V^ p d _ l7 and likewise for V SSP , V ML , B sp ,"b s s p ',' Xg, 
X^ and J~g. As the SP estimator is semiparametrically efficient, V ML for the Gaussian 
copula must be the same, regardless of the margins. Moreover, when all margins are 
normal, ML is simply the empirical correlation matrix. Adding an extra dimension 
leaves the remaining estimators unchanged. Hence, V^ L , corresponding to the [d — 1)- 
dimcnsional sub-model, will be the same as for the (d— l)-dimcnsional Gaussian copula. 
The same argument can repeated for all (d — l)-dimcnsional sub-models, covering all 
levels but the top. Due to its levelwise structure, the SSP estimator for a given sub- 
model is unaffected when adding an extra dimension, and so must the corresponding 
block of V SSP be. Accordingly, we must have Vf SP = Vf p = V^ L . Hence, it remains to 
show that V?f p = V? P and K, SSP = K SP , related to the estimators ^ p , and 9^ p 
for the top level copula. According to Theorem 1 from Tsukahara [40] and Theorem 1, 
respectively, 

and 

^$g M - ldWd ) A Z SSP ~ Af(0, ^ 2 SSP ), 

as n -> oo. Now, define U„ = (U„i, . . . ,U„„), with \J nj = (Fi n (Xij), . . . ,F dn (X d j)), 
j = 1, . . . , d, and let 

* SP (u„ ; e SP ) = ix:^ P (Un;e SP ) = o, 

n z — ' 

fc=i 

* SSP (U„;0 SSP ) = I Y> SSP (U„;0 SSP ) = 0, 

fe=i 
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be the estimating equations of the SP and SSP estimators, respectively. Further, let 

1 " n 

*(U n ;0) = *if d _ 1)/2 (U n ;0) =*sg* (u n ;fl) = -£ _ log( Cl ... d (U n ;0)). 



According to Theorem 1 from Tsukahara [40] 

^_, T ,, TT , ^(Un;e)^ SP 

'Id Did 



*(U n ; SP ) = #(U„; 0) f - * ldKJ 



+ ~7^T V U l^d-2 ~ + Op - ] - U. 

°' t 'l->d-2 \ n 

Likewise, using Theorem 1, one obtains 

*(U„;0 SSP ) = *(U n ;») + ^^(^ " 

.~) {ft P_ 2 _ ei _ !) + JI, 



■d-2 



Hence, 



with 



and 



^(«SS„ - J = ^(0?V 2 - «- 2 ) + op Q 



Ai = y = — > j, log(ci...d(U„; 0)) 

dd l^d-2 71 k =l 9(> ld\v ld dOl^d-2 



^ = = n 2- l0S(Cl - - d(U " ; 

According to the assumption, 

V»(^Va - ft-d-a) Y ~ -A/" d ( d _i)/2_i(0, Vf L ), n -> oo. 

Thus, 

— 0, n — > oo. 

Moreover, under the assumed conditions, Ai — — > — X^ 12 and A 2 — — > — 2e,2! a s ?i — > oo. 
Hence, 
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which means that Zgp = ■Z'ssp- In other words, V 2 SSP — V 2 SP . Moreover 



v ssp 

v 12 



/ V ML T , /t— 1 rSSPn 
T (~ V 1 -t-6,12 + c/ e 12^12 )y 

-1-6,2 



1 



1 / Bf P 
1 + — + 

1-8,2 V 2(9,2 2e i2 

1 



tT V ML T z -t-T ^—1 t>SSP 
X e,12 V l x fl,12 - ~ -t-e, 12«^ 8. 12 ±5 12 

-t-0,2 

rT -trSSP 



^2-^,2 +-D 2 ~ X e,12 V l X e,12j ~ ^fl,12 V 12 



Correspondingly for SP 
1 



v sp 

v 12 



. — Vj Xg i i2+X 1 I rS 12 — — Xe,12 

-t-0,2 V V -^6,2 



K SP = 



1 



Bf p 



?sp 



L -t-T V ML T 2 tT / R SP &2 

T- + ^ r- -tU2 V l X 0,12 - t ± 6.\2- L \ [ a l2 ~ t X 0,12 

-1-6.2 \ -1-6.2 -1-6,2 -1-6,2 \ -1-0,2 



1 



SP 



—^-(l g ,2+B 2 -X gi2 V 1 Ie,i2) - 2X e 12 V 12 
x e,2 



Since the estimating equation for 6\M Vli is the same for SP and SSP, B 2 SP 
Moreover, V 2 SSP = V 2 SP . Consequently, Vff p = Vf P . 



A. 3. Covariance matrices from Example 4.1 

The asymptotic covariance matrix of the ML estimator is given by 

1 / 2 ( 1 -/°12) 2 "13 "23 

V ML = M « 13 2(l-p| 3 ) 2 

2 \ "23 "12 2(1 - P \ 3 )\ 

with Vik = 2pik(l — — pjk) ~ PtiPlk\R-\- For the SSP estimator, we have 

■trSSP -r-ijT I T~ X \T i /t-IrSSP/ /x- 1\T 

V =.7 e J +J e B e (Jg), 

where 



/ 1 + Pf 2 



/Co = 



fcia 



(l-pf 2 ) 2 (l-pf 2 )(l-^ 3 ) 

fcl2 



1 + pjg 

(l-pf 2 )(l-pl 3 ) (1-P 2 23 ) 2 













\ 



|R|+2(pi3-pi 2 p 23 ) z 

IRI 2 



2N 
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Jo 



>SSP 



( 1 + P12 

:i-p? 2 ) 2 





V |R| 



J23 

R| 2 



1 + P23 

(i-p! 3 ) 2 
ii2 
IRI 2 



Pl2(! + Pl2) 



(1~P? 2 ) 2 
P23^12 + P12&23 - P12P23« 

2(l-p 2 2 )(l-p 2 3 ) 

(Pl3 - Pl2P23)bl2 





|R| +2(pi 3 -P12P23)' 

|R| 2 / 

P23^12 + P12^23 - P12P23" 

2(l-p 2 2 )(l-p 2 3 ) 

P23( 1 +P23) 

(i-pIs) 2 

(P13 - Pl2P23)k 2 3 



2(l-p 2 2 )|R| 



2(l-p 2 3 )|R| 



with a = 1 



P?2 



P?3 



2(p 4 ; - pikPik){pik - PiiPifc) and 
covariance matrix is 

/2(1-P 12 ) 2 

"23 



•P23> fc 12 = (P13 - Pl2P23)(|R| 



P?3 



Pika{l 



(Pl3 - Pl2P23)fel2 ^ 

2(l-p 2 2 )|R| 

(Pl3 - Pl2p23)&23 

2(l-p 2 3 )|R| 

( 1 + Pl3)(Pl3 -P12P23) 2 

|R| 2 

P 2 2P 2 3)> i« = -Pifc|R| 



• 2 ( 1 + P 2 fc) Pi '°i- The resulting 



"13 "23 

2(1 - p! 3 ) 2 "12 
w 12 2(1 -p 2 3 ) 2 



rML 
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